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Two cDNA clones containing the complete protein-coding 
sequence of 1,188 nucleotides as wall as the 5' and 3* non- 
coding regions of human prostatic acid phosphatase (PAP) were 
isolated and sequenced. The size of PAS; mBNAs from benign 
prostate hyperplasia and cancerous prostate was estimated to 
be 3,2Kb, indicating that the 3 1 downstream polyadenylation 
signal was used. Several genomic clones containing parts of 
the human SAP gen* were isolated and the nucleotide sequence 
or" ten exons and their flanking regions was. determined. The 
protein- coding sequence of the human PAP gene was interrupted 
by nine introns- The positions of all nine introns present in 
the human .PAP gene were horaologpus to those of the first nine 
introns in the human lysosomal acid phosphatase (LAP) gene. 
However, the last (11th) exon of the LAP gene encoding the 
COOH-terminal domain, which includes a transmembrane segment r 
was found to be absent in human PAP gene. Southern blot 
analysis of ten mammalian genomic DKAs gave multiple EcoM 
fragemeiits. The data of human genomic DNAs were consistent 
with the total length of the PAP gene 03? at least 50 
Jcilnbasea. o 1992 * C fni*»ic mc* 



Acid phosphatases (orthophosphoric monoester 
phosphohydro lase r EC 3: 1,3,2) are a group of enzymes capable 
of hydrolysing phosphomonoesters under acidic conditions. 
They can be differentiated according to their immunological 
properties r tissue distribution and subcellular location (1) . 
Prostatic acid phosphatase (PAP) is synthesized under 
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androgen regulation by epithelial cells of the prostate and 
secret eel into seminal fluid. Human PAP has been used for over 
50 years as a tumor marker to diagnose patients and prognose 
with prostate career (2-3) . The physiological function of PAP 
is still not well understood, hut recent reports suggest that 
PAP exhibits protein tyrosine phosphatase activity (4-5) . 

In order to study protein s tincture -function and the 
mechanism of androgen regulation, we have undertaken protein 
sequencing, cDNA cloning and isolation of the human PAP gene. 
We previously described the amino acid sequence and antigenic 
structure of ths secretory pap enzyme (&-?). we also cloned 
the partial PAP cDiJA sequences and mapped the pap gene to 
chromosome 3 (6) . Here we report the isolation .of a nearly 
full-length cdna and characterisation gf exon-intron 
organisation of the human pap gene as well as the results of 
Northern and Southern blot analyses. 

MATER TRT,.q AWH MttTwnnfl 

r " . 

Clonlner rvf fun -1 gfiyt-h rnt^Ar 

In order to isolate the cDNA clones containing 5' non~ 
coding and protein- coding regions, the EcoRI*-EcoRX and 
EcoRI-tfind III D1«A fragments (Fig. 1) of the human PAP oDNA 
clone hP4 0 reported previously (6) were labelled with (a- 
32P)dCTP and used as a probe to screen a prostate lambda gtll 
cDNA library (Clontech) uaing standard procedures (8) , One 
DNAs purified from positive cDNA clones were partially 
characterized by restriction eadonuclease mapping and 
Southern blotting. Their EcoRI D3>JA fragments were subclones 
into Mi3rapia phage and the nucleotide sequences of the 
inserted Okas were determined - by the dideoxy. chain 
termination method with the sequencing protocol using 
deoxyad«nosine S'-Ux-SSS) thiotriphosphate 1 (9) . 

Isolation 'of genomic cl^^ 

Human genomic clones containing parts of the PAP gene 
were isolated from a total lymphocyte genomic DMA library 
(Stratagene) using the cDNf a probes of clone hP4Q (6,8). 
Genomic clones containing the exon 1 and its upstream 
sequence of the PAP gene were subsequently isolated usijag 
cdna probes of 5' coding-region froni clone hP303 {see Fig. 
1) . The DNAs purified from positive g&nomic clones were 
characterized by restriction endonuclease mapping and 
Southern blotting. The DNA fragments exhibiting positive 
hybridization to the cDHA probe were subcloned into 
Ml3mpl8/13 phage or Bluescript phagemid. The' nucleotide 
sequences of the e*ons and tHeir flanking regions were 
determined by dideoxy chain termination method using either 
universal or exon-specific primers (9) * 

1AM 



PAGE 12127 * RCVD AT 6(712 0044:10:50 PM [Eastern Daylight Time] ' SVR:USPTO€FXRM/0 * DNIS:8729306 * CSID:570 895 2702 * DURATION (mnvss);!^ 



06/07/04 15:20 FAX 570 895 2702 AVENTIS KNERR 

i 

f ITS68 ON XM/XXJ ^Z-fX OBI frO/frO/EO 



2)013 



Vol. 184, No. 3, 1992 



BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS 



i 
* 



k 



Hf! 



Wori-harn blot analvsis: 

Tissues of benign prostate hyperplasia and cancerous 
prostate from patients were kindly provided by Drs . Y. 
Sharief and J. Mahler, and were quickly frozen in liquid 
nitrogen, and total RNAs were extracted .by the .guanidinium 
thiocyanate-pbenol-chloroform method using the RNAzol B kit 
(Biotecx Laboratories f Inc.). Poly (A) +RNAS were purified by 
oligo(dT) cellulose using the FastTrack kit' (Invitrogen 
Corp.) . Two Micrograms of poly (A) +HNAs were electrophone sed 
on a 1.0% agaxose/2.1% formaldehyde gel, transferred to a 
Nytran membrane (Schleicher & Schull) by capillary blotting 
with 10X SSC (1..5H sodium chloride/0 . 15M sodium citrate / 
pH7) r and UV-cross-1 inked as described (S) * The Northern blot 
was hydridi2ed overnight at 42 C in 50% Formamide/ 5X 
Denhard'a solution/ 5X SSPE (0.75M sodium chloride, SOniM 
sodium phosphate, 5mM EDTA r pH7.4)/ 1% £DS/ ' salmon sperm DNA 
(200 ug/ral) with randomly 32P-labelled PAP cDNA probe 
containing no Alu-sequenee , The filter was rinsed twice at 
room temperature for 15 rain each and washed twice at 65 C for 
1-5 Mill each : with IX SSPE/ 0-1% SD5* Autoradiagrams were 
obtained by exposing the blot to Kodak film with 

intensifying screen at -70 C for various, lengths of times * 

The Southern blot containing EcoRI fragments of ten 
mammalian genomic DNAs was obtained from BIOS Corporation and 
hybridized using 32P-labe-lled PAP cDNA probe containing no 
Alu-sequence according to the procedure recommended by the 
supplier. 



KESTTLTS 
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HnclMtidP H^rynenfrw of PAP ^nTOB^- 

Two cDNA clones hP303 and hP317 were isolated and 
partially characterized (Fig. 1). The nucleotide sequence of 
both strands of the inserted DNA from clone hF303 was 
determined ajad found to consist of complete protein-coding 
Sequence of 1,188 nucleotides as well as 5 r (42 nucleotides) 
and 3' (1136 nucleotides) non-coding regions. This PAP cDNA 
sequence was consistent with. that reported previously (6,12) . 

Characterisation nt PAP g*n»? 

- With the use of the CDNA probe of clone hF40, several 
genomic clones containing parts of exons 2 through 10 were 
isolated (Fig. 2) and the nucleotide sequences of these nine 
exoiis and their flanking regions were determined. The 5' cdna 
region, including exon. 1, of the newly isolated clone hP303 
was subsequently used as a probe to reacreen the genomic DNA 
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Flcy. 1. Restriction endonuclease map of human Eae cdHa. 
clones « 

The restriction sites given CR, EcoRI? R*, EcoRI star 
activity; P, Pst I; 3, Hind III) a*e those \13ed in nucleotide 
sequencing and preparation of hybridization probes; The 
protein-coding sequence is denoted by an open box- The 
dimeric and jsonoraerio Alu-repe&ts are shown by hatched boxes. 
Clones hP7 and hfc40, which were reported previously (6) , 
contain- poXy(A+) -tail of 51 and 20 A's, respectively,, and 
their 5' sequences were presumably truncated by an EcoRI Star 
activity during construction of the CD HA library. Clones 
hP303 and QP317 were isolated in this investigation * 
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Flo, 2.. The exon-intron organization of the human PAE gene 
along with restriction endonuclaase sites, 

The pcotein^coding and 3 ' noneoding regions of the 
PAP cDNA are indicated by open and hatched boxes, 
respectively. The exons in the. PW gene are numbered in 
Arable and shown by solid-blocks . The exact sizes of nine * 
introns are not known, and th« intron between exon 1 and 2 
was not* overlapped by these isolated genomic clones. Only 
those restriction sites <& f EcoRty K, Kpnl; X, Xbal) which 
were used to isolate DMA fragments and to subclone are given. 
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library. and five identical clones were found to contain exon 
1 and its 5 1 Upstream region (Fig. 2) - The nucleotide 
sequence of ten exons of the PAP gene Was identical to the 
PAP cDNA sequence (Fig. 3) . 

The protein-coding sequence of the pap gene waa 
interrupted 'by nine introns- Exon 1 encodes a leader peptide 



a^at&aaagtaaactgaaaacaaggataguatgtctBctaautgctct^ 
tctctwgctcaaaattUcatgaratUtttarataceaaacctgteeta^^ 

ttcccactacaMCttttgaagttetAGCAfinCCTCCTM^ Exon X 

HRAA PLLLAR AA 5I.3L6 

TTTlTTGTTTCirStTTrTTTTtCTGt^AfiTOQAACTGTACTAGCCAAGGAGn „ . „ . ,ctart tteaqdQTTTWGCATGGASACCGA Exun 2 

FLFLLFFKLDRSVLAlfELK-FYTL Intron \ V F R H fl □ R 

-I+l 8 9 

A^KAnGAiaCCTnCCCACTGACCXATAAAGGM^ „ „ . mttcaLaqCTGGGCATG6A6CAG Exon 3 

5PIDTFPTDPTKC5SV-PQtiF69L.ro tntron 2 L G H E Q 

WTTAT6AACTTGSAGACTATATAAGAAA(jA1iATATA^ „; „cctactCtagGTTTATATTCEAA(|CACft 

H Y E l £ E V I RKRYflKFLNESYKHeq Intron 3 V r 1 ft S t 

69 7a 

toCm^CGGACTnGATGAGTGCTAIGACAAACa £* on 4 

DVDRTLMSAKTWLAALFPPESVSIWMPILLWOPIPVHTVP 

CTTTCTGAAGATCAGgtcagtatac ttatcctcagRGCTATACCTGCCmCAtH^^ E*en 5 

L S E 0 g Intron 4 LLYLPFRNC'PRFQELESETLtfSEEF 

120 121 

CAGAAQAGGCTGCACCCTTATAAGgtt Uoogct bgtg 1 1 tragGATmATAGCTttCnGGGAAAACTT^^ Exqp 6 

a^ALHPTC' intron 5 DPI ATLQtCLSGLHGQOlFfLWS 

153 154 

AAACTCTACGACCCTTTATATTCTCAttgtiMajBaa «..«tttgetaa^fiTGTrCACAATTrCACmAC[rTCa6GGIXAn Ewm 7 

It" tf Y D p -L .Y C e Intron S SVHIIFT. UPSWAT^pTHTJCLftE' 

IB* ies 

nGTeM^rttTtMCCTGTnXTCTATCI^ fesaggtatt tttttttaflgGGTOTClTTEtJTCAATGAAATC EXon 8 

LSELSLL5LYGIHKQKEKSALQG Incrqn 7 G V L V N £ l 

2211 229 

CTCAATCACATCAAGAGAfirAACTL^G ATACCAAGCTACJ ARMACTTaTCaT GTATTCTQCGg taag ta M t, tet ct^t tt$CAT6ACACT ACTGTGAGTDGCCTACAC Exon 9 

LNHMKRATqiRSYKtCVCKYSA. IfltW 9 HQTTVSGL0 

.. 256 ZS7 

ATCGCGCTA WTGmACAACGGACTCCTTCtTttCtATGCTTcnwCACnGACGGAA tQflCt tfl tc to ccagGGACTACTTTCTG 

HALOTlUfrLLPP YASCHLTEL^FE KG iritfflfi 9 £ Y F V 

Z91 M2 
GAGAT^ACrAKGGAATGAGAraCAerjvcGA^™ ^ Qn 10 

EHYYRNETOHEP.VPLMLPGCSP3CPLCAFA^LV«PV-|P0D 

TG GT CCACBG ACT(TTATG4CCACAAa.C4(hCOtf€M 



*5TECMTTN.$HQGTED 



S I D " 
3§4 



Fi< r- 3, Nucleotide sequences of the protein -coding exons and 
their flanking regions of the human. PAP gene _ 

Hie nucleotide sequences of tea exons as well ag c&NA 
are given in upper case, while those of flanking sequences 
are in lower ea$e. The 5' non-coding region Of clone hf-303 

.contains AZ nucleotide, and the exact 3ite of transcription 
initiation remains to be determined. The sizes of exons 2 
through 9 are 96 r 90, 153, 9$, 93, 132, 96, and 104 ' 
nucleotides, respectively . The 3| non-coding region of 1,513 
nucleotides present in cDNA sequence was reported previously 
(6],.. and only partial sequence determined in genomic DNA is 
presented, The deduced amino acid sequence is shown below the 
coding-exons, and is numbered" from the fir3t residue of the 

.secretory enzyme. The 32 amino aaids in the leader peptide 
are given in negative numbers*- The translation termination 
codon TAG is denoted by *, it should be noted that a Cys 

'instead. of the previously reported'val (6) was identified at 
amino acid 340 Of the secretory enzyme.' 
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of 32 amino acids and tie first eight amino acid3 0 f t ha 
secretory enzyme as well as a 5' non-coding region of at 
least 42- nucleotides. Sxons 2 through <> from 90 tQ 153 

nucleotides in S i ze . ^ last (10th) m ^ COqh _ 

terminus of 6* amino acids ' and 3- non-coding region of 
649/1916 nucleotide*, depending on the polyadenylation site 
All exon/intron junction- sites were consistent with the 
conserved gt/ag sequence, e3£C e P t gc/ag for intron 3 . The 
first eight introns of the PAP gene are cl asa -0 introns 
located between codons. The ninth intron is a class-ll intron 
located between 2nd ana 3rd bases of codon 291. The length of 
these nine introns wa 3 not precisely determined, and the size 
of intron 1 was relatively, large and not overlapped by these 
isolated genomic clones. The total length of the human pap 
gene was estimated to be at -least 50 kilobases. 

E XPrflflflinn of p-bp r . n lyf7U.+t™ra f1 . 

The results of Horthern blot analysis of poly (A) +RKAs. 
isolated from benign prostate hyperplasia and canerous 
prostate are presented in Fig. 4 . The size of the most 
predominant PAP ntRNAa was estimated to be 3.2Kb, although 
weak aingal of smaller hha 3 vas detected in the 3 pecijuen of 
cancerous prostate. 

Southern Mnr ^nl^ii «r n, amm fl 1 { n T1 „^ m ^ . 



The Ecdm fragments of mammalian genomic DNAs were 
analyzed using pap nDNA probe contianing no Alu-seguence 
(Fig. 3) . Human genomic dnas exhibited multiple BcoRl 
• fragments, and most of the other mammalian DNAs gave three 
hybridization hands. - . 

DTSrrngSTflrc 

in our previous report (6), all pap cm,^ were truncated 
•t 5 • p«, teill -coaiag region because of the presence of EcoRl 

construction of eDHA library, in 
this investigation, two newly isolated clones hP31-7 and hP303 
contain 5 ' noncoding (42 nucleotides) and complete protein- 
cod^g (1136 nucleotides) sequences in addition to partial 3- 
noncoding «gion. The reported heterogeneous slzes (M6 ^ 
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CAP BPH 





' 0 iftl 

Fig- 4. northern blot analysis of poly(A)+E«As t root, human 

prostate tissues, 

Tissues: CAP, cancer on 5 prostate; BPH, benign 

prostate hyperplasia, The Northern blot was probed with Pap 

cDNA Containing no Alu-seqgence as well as human GAPDH cDNA. 

The size of qAPDfl mSNAS wag eatiattted to be l.^Kb 

predicted (data not shown) - . 



as 



Fla. 5 r Southern hlot analysis of mammalian genomic DNAs. 

Lanes 1, dog; Z, cat;. ; 3, rabbit; 4, cow; 5, sheep; G r 
mouse; 7, fat; 8/ hamster; 9; pig; and 10, human. The 
autoradio^raph of lanes 1 through 9 was exposed for 4 dayd, 
while lone 1Q was exposed for 13 tars. 



1913 nucleotides) of 3' noncoding sequences were due to the 
alternative polyadenylatiori signals used. Northern blot 
analysis estimated the size of PAP mRNAs from benign prostate 
hyperplasia and canerous prostate to be,3*2Kb. These results 
indicate -that the predominant iniRNAs resulted from the usage 
of the 3* downstream polyadenylation signal - 

The PAP CDNA sequence from th^ newly isolated clone 
hP303 was ' found to be identical to that of the 10 exons 
present in the human PAP gene. This cDN& sequence is also 
identical to that reported by van Etten et al. (10) . The 'PAP 
cDMA sequences described by Viiko et al, (11) and Tailor et 
ai,(12) exhibited some nucleotide differences, wMch might be 
due to genetic polymorphism and/or sequencing errors . The 
deduced amino acid sequence, of PAP contains a leader peptide 
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of 32 residues and secretory enzyme of 354 amino, acids as 
described previously (6 f 10-12) . 

The amino acid sequence of human PAP was reported (6) to 
exhibit 50% (177/351) identity to the corresponding sequence 
o£ human lysosomal acid phosphatase (LAP) . The protein -coding 
sequence of 423 amino acids from the human LAP gene was shown 
to be interrupted by 10 introns (13) * The positions of eight 
introns (1 to 6, 8 and 9) present ia the human PAP gene were 
found to be homologous to the sites of corresponding introns 
from human LAP gene. The seventh intron of pap gens is 
located between codoxi no. 260 and 261 (Gly-Gly at amino acid 
no. 228-229 of secretory enzyme) r while that of the LAP gene 
is located within codon 257 (equivalent to amino acid 229 of 
the secretory PAP enzyme ) . This difference could be due to 
the difficulty of sequencing the string of six G's in the LAP 
gene (13) * The 10th intron of the LAP gene was located within 
codon 380, which corresponds to codon 392 of the PAP gene 
(amino acid 350 of the secretory PAP enzyme) , The last (11th) 
exon of the LAP gene encodes the 43 COOH-terminal amino 
acids, including a transmembrane segment, which is absent in 
the PAP gene, Xhe PAP and LAP genes f which are located on 
human chromosome no. 3 and 11, respectively, evolved from 
duplication and divergence of an ancestral acid phosphatase 
gene, xt has been suggested that the ancestral exon 11 was 
lost by deletion in the human PAP gene <13V. Alternatively, 
the exon 11 of the Lap gene could have been added after gene 
duplication-* 

Southern blot analysis of ten mammalian genomic DNAs 
(Fig. 5) indicated multiple EcoRI fragments of human PAP 
gene, Thi3 is consistent with the exon-intron organization of 
human pap gene (Fig. 2) elucidated in this investigation. 
Whereas most of the other mammalian DNAs gave three bands, 
indicating the presence of at least two Egofcl sites in their 
PA£ genes . 

The synthesis of human PAP hy .epithelial cells of the . 
prostate is induced by androgen . (1) . Xhe putative promoter 
region of human PAP gene remains to be characterized and the 
investigation -on the molecular mechanism (s) of androgen 
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dependency of prostate cancer <14) ^ 
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